Some Problems in Estonian Wordnet
نویسنده
چکیده
WN is an on-line lexical reference system whose design is inspired by current psycholinguistic theories of human lexical memory. Nouns, verbs, adjectives and adverbs are organised into synonym sets, each representing one underlying lexical concept. Different kinds of semantic relations link the synonym sets (synsets). WN is based on word meaning; all of the words that can express a given sense are grouped together in a SYNONYM SET, or SYNSET (see also Vider K., Orav H. 1996. WORDNET: An On-line Lexical Database. Papers of the First Swiss-Estonian Student Workshop on Computational and Theoretical Linguistics: 64-68).
منابع مشابه
Concerning the Difference Between a Conception and its Application in the Case of the Estonian WordNet
One source of Estonian WordNet have been corpora of Estonian. On the other hand, we get interested in word sense disambiguation, and about 100,000 words in corpora are manually disambiguated according to Estonian WordNet senses. The aim of this paper is to explain some theoretical problems that “do not work well in practice”. These include the differentiation of word senses, metaphors, and conc...
متن کاملFirst steps in checking and comparing Princeton WordNet and Estonian Wordnet
Each expanding and developing system requires some feedback to evaluate the normal trends of the system and also the unsystematic steps. In this paper two lexicalsemantic databases – Princeton WordNet (PrWN) and Estonian Wordnet (EstWN)are being examined from the visualization point of view. The visualization method is described and the aim is to find and to point to possible problems of synset...
متن کاملThe Revision History of Estonian Wordnet
Estonian Wordnet started in 1998 and since then at least once a year a new version has been released. The paper gives an overview of the main indicators for every year, as well as highlighting some problems that have been solved or as yet have to be solved.
متن کاملAutomatic WSD: Does it Make Sense of Estonian?
This paper describes a fully automatic Estonian word sense disambiguation system called semyhe which is based on Estonian WordNet (EstWN) hyponymjhypernym hierarchies and meant to disambiguate both nouns and verbs. 1 Short description of the system The main inspiration for our system is Agirre and Rigau (1996) similar system that disambiguates the English noun senses based on WordNet hyponymjhy...
متن کاملDense Components in the Structure of Wordnet
This paper introduces a test-pattern named a dense component for checking inconsistencies in the hierarchical structure of a wordnet. Dense component (viewed as substructure) points out the cases of regular polysemy in the context of multiple inheritance. Definition of the regular polysemy is redefined – instead of lexical units there are used lexical concepts (synsets). All dense components ar...
متن کامل